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Deep Sequencing Analyses of DsiRNAs Reveal the 
Influence of 3' Terminal Overhangs on Dicing Polarity, 
Strand Selectivity, and RNA Editing of siRNAs 

Jiehua Zhou 1 , Min-Sun Song 1 , Ashley M Jacobi 2 , Mark A Behlke 2 , Xiwei Wu 3 and John J Rossi 14 

25/27 Base duplex RNAs that are substrates for Dicer have been demonstrated to enhance RNA interference (RNAi) potency 
and efficacy. Since the target sites are not always equally susceptible to suppression by small interfering RNA (siRNA), not 
all 27-mer duplexes that are processed into the corresponding conventional siRNAs show increased potency. Thus random 
designing of Dicer-substrate siRNAs (DsiRNAs) may generate siRNAs with poor RNAi due to unpredictable Dicer processing. 
Previous studies have demonstrated that the 3'-overhang affects dicing cleavage site and the orientation of Dicer entry. 
Moreover, an asymmetric 27-mer duplex having a 3" two-nucleotide overhang and 3'-DNA residues on the blunt end has been 
rationally designed to obtain greater efficacy. This asymmetric structure directs dicing to predictably yield a single primary 
cleavage product. In the present study, we analyzed the in vitro and intracellular dicing patterns of chemically synthesized 
duplex RNAs with different 3'-overhangs. Consistent with previous studies, we observed that Dicer preferentially processes 
these RNAs at a site 21-22 nucleotide (nt) from the two-base 3'-overhangs. We also observed that the direction and ability 
of human Dicer to generate siRNAs can be partially or completely blocked by DNA residues at the 3'-termimi. To examine 
the effects of various 3'-end modifications on Dicer processing in cells, we employed lllumina Deep sequencing analyses to 
unravel the fates of the asymmetric 27-mer duplexes. To validate the strand selection process and knockdown capabilities 
we also conducted dual-luciferase psiCHECK reporter assays to monitor the RNAi potencies of both the "sense" (S) and 
"antisense" (AS) strands derived from these DsiRNAs. Consistent with our in vitro Dicer assays, the asymmetric duplexes 
were predictably processed into desired primary cleavage products of 21-22-mers in cells. We also observed the trimming 
of the 3" end, especially when DNA residues were incorporated into the overhangs and this trimming ultimately influenced 
the Dicer-cleavage site and RNAi potency. Moreover, the observation that the most efficacious strand was the most abundant 
revealed that the relative frequencies of each "S" or "AS" strand are highly correlated with the silencing activity and strand 
selectivity. Collectively, our data demonstrate that even though the only differences between a family of DsiRNAs was the 3" 
two-nuclotide overhang, dicing polarity and strand selectivity are distinct depending upon the sequence and chemical nature 
of this overhang. Thus, it is possible to predictably control dicing polarity and strand selectivity via simply changing the 3'-end 
overhangs without altering the original duplex sequence. These optimal design features of 3'-overhangs might provide a facile 
approach for rationally designing highly potent 25/27-mer DsiRNAs. 
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Introduction 

RNAinterference(RNAi)isasequence-specificpost-transcrip- 
tional gene silencing process triggered by 21-25 nucleotide 
(nt) small interfering RNAs (siRNAs). In cells these siRNAs 
are generated by the ribonuclease III Dicer which processes 
these siRNAs from longer double-stranded RNAs. 1,2 In asso- 
ciation with Dicer, the cleaved small RNA products possess- 
ing a 5'-phosphate and 2-base 3' overhang are loaded into 
large multiprotein complexes termed RNA-induced silencing 
complexes (RISC) and one of the two strands is selected as a 
guide for the sequence-specific silencing of the complemen- 
tary target RNA. 2-5 The PAZ domain, an RNA-binding domain 



of Dicer which is also found in Argonaute proteins, specifi- 
cally recognizes the 3' end of single-stranded RNA, suggest- 
ing it can function as a module for anchoring the 3' end of 
the guide strand within the RISC. 56 For the Dicer-substrate 
siRNAs (DsiRNAs), the 3' overhang therefore 7-9 affects dic- 
ing polarity (binding of Dicer) as well as subsequent strand 
selectivity in RISC, 10-15 consequently influencing the overall 
RNAi efficiency. It was previously reported that chemically 
synthesized 29 base duplex short hairpin RNAs that are sub- 
strates for Dicer are more potent RNAi triggers than 1 9 base 
duplex short hairpin RNAs. 16 Similarly, DsiRNAs of 25-30 nt 
can be up to 100-fold more potent than conventional 21-mer 
duplexes targeted to the same sequence location. 17 This 
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increased potency might be attributed to the fact that Dicer- 
generated 21-23-mer siRNAs are more efficiently incorpo- 
rated into RISC through the physical association of Dicer with 
the TAR RNA-binding protein and Argonaute proteins. 

Dicer cleavage of a blunt ended 27-mer duplex generally can 
generate a variety of different 21-23-mers depending on its 
sequence parameters, 15,18 such as length/composition of the 
3'-terminus, 19 GC content, inverted repeats, etc. Furthermore, 
it has been demonstrated that siRNA efficacy is highly depen- 
dent on the target position 20 and RNAi potency is susceptible 
to shifting a 21-mer siRNA even by a single base along the 
target mRNA sequence. 21 The overall RNAi efficacy of DsiR- 
NAs critically depends on the composition and potency of the 
processing products. Thus acquiring a better understanding 
of DsiRNA designs is useful for enhancing RNAi efficacy. In 
this study, we show that it is possible to predictably control 
dicing polarity and strand selectivity via simply changing the 



sequences of the 3'-end overhangs without altering the origi- 
nal duplex sequence. Recently, an asymmetric 25/27-mer 
duplex having a 3' two-nucleotide overhang and two 3'-DNA 
residues on the blunt end has been rationally designed to 
obtain greater efficacy 22-24 This asymmetric structure directs 
dicing to predictably yield a single primary cleavage product. 

In the present study, a simple 5'-end labeled gel assay was 
performed to analyze in vitro dicing patterns of chemically 
synthesized symmetric or asymmetric 25 base pair duplex 
TNP03 (Homo sapiens transportin 3) RNAs with different 3' 
two-nucleotide overhangs. We observed that a series of het- 
erogeneous products were generated by Dicer cleavage from 
these substrates and Dicer preferentially processes to a site 
22 nt from the 3'-end of the substrates that have a two-base 
ribonucleotide overhang. The 3' two-base ribonucleotide 
overhangs can help orient Dicer on the substrates and facili- 
tate Dicer entry and cleavage. In contrast, the direction and 



Table 1 The 21-mer siRNAs and 27-mer Dicer-substrate siRNAs targeting 77VP03and the target sequences are listed 



TNP03 21-mer siRNA and 27-mer Dicer-substrate siRNA (DsiRNA) TNP03 target sequence (site l)(GC-natural match) 



3—CG 


CGACA TTGCAGCTCGTGTACCAG GC — 3 


Group I 


Symmetric 21/21 -mer 


Original 21 (UU)/21 (UU) 
Shifted 21 (UU)/21 (UU) 


5' 

3' yy 


CGACAUUGCAGCUCGUGUA UU 
GCUGUAACGUCGAGCACAU 

5' UGCAGCUCGUGUACCAGGC UU 
3' UU ACGUCGAGCACAUGGUCCG 




Asymmetric 25/27-mer 


25 (GC)/27 (UU) 


5' 

3 r uy 


CGACAUUGCAGCUCGUGUACCAGGC 
GCUGUAACGUCGAGCACAUGGUCCG 






25 (GC)/27 (tt) 


5' 
3' tt 


CGACAUUGCAGCUCGUGUACCAGGC 
GCUGUAACGUCGAGCACAUGGUCCG 




Symmetric 27/27-mer 


27 (UU)/27 (UU) 


5' 

3' yy 


CGACAUUGCAGCUCGUGUACCAGGC UU 
GCUGUAACGUCGAGCACAUGGUCCG 






27 (tt)/27 (tt) 


5' 
3' tt 


CGACAUUGCAGCUCGUGUACCAGGC tt_ 
GCUGUAACGUCGAGCACAUGGUCCG 






27 (UU)/27 (tt) 


5' 
3' tt 


CGACAUUGCAGCUCGUGUACCAGGC UU 
GCUGUAACGUCGAGCACAUGGUCCG 






27 (tt)/27 (UU) 


5' 

3' yy 


CGACAUUGCAGCUCGUGUACCAGGC tL 
GCUGUAACGUCGAGCACAUGGUCCG 


Group II 


Asymmetric 25/27-mer 


25 (gc)/27 (GC) 


5' 

3' CG 


CGACAUUGCAGCUCGUGUACCAG gc 
GCUGUAACGUCGAGCACAUGGUCCG 






25 (gc)/27 (tt) 


5' 
3' tt 


CGACAUUGCAGCUCGUGUACCAG gc 
GCUGUAACGUCGAGCACAUGGUCCG 






25 (gc)/27 (UU) 


5' 

3' yy 


CGACAUUGCAGCUCGUGUACCAG gc 
GCUGUAACGUCGAGCACAUGGUCCG 






25 (gc)/27 (AA) 


5' 

3' AA 


CGACAUUGCAGCUCGUGUACCAG gc 
GCUGUAACGUCGAGCACAUGGUCCG 






25 (gc)/27 (GG) 


5' 

3' GG 


CGACAUUGCAGCUCGUGUACCAG gc 
GCUGUAACGUCGAGCACAUGGUCCG 






25 (gc)/27 (CC) 


5' 

3' CC 


CGACAUUGCAGCUCGUGUACCAG gc 
GCUGUAACGUCGAGCACAUGGUCCG 



The sense strand is presented from 5' to 3' and is marked as black, while the antisense strand is presented 3' to 5' end is marked as gray. Two-nucleotide 
3'-overhangs are underlined. Ribonucleotides are upper case and deoxyribonucleotides are in lower case. These RNAs are named by their strand length 
and overhangs: the number indicates the length of RNA strands; "(NN)" means the two-base 3' ends. Starting with a 19-mer siRNA sequence, shown as the 
original 21 (UU)/21 (UU) mer, these 27-mer DsiRNAs have are extended by six bases upstream of the target sequence. 
DsiRNA, Dicer-substrate small interfering RNA; TNP03, Homo sapiens transportin 3; siRNA, small interfering RNA. 
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ability of human Dicer to generate siRNAs can be partially 
or completely blocked by DNA residues at the 3'-termimi. 
Consistent with previous studies, combining of such features 
into an asymmetric DsiRNA (25/27-mer DsiRNA) provides an 
optimal design for obtaining a single, primary cleavage prod- 
uct with the best RNAi potency. 

To investigate the influence of the two-base 3' overhang 
on dicing of substrates in cells we employed lllumina Deep 
sequencing analyses to unravel the fates of the asymmetric 
25/27-mer TNP03 duplexes in HEK293 cells. To validate the 
strand selection process and target knockdown capabilities 
we also conducted dual-luciferase psiCHECK reporter assays 
to monitor the RNAi potencies of both the "S" and "antisense" 
(AS) strands derived from these DsiRNAs. In similarity to our 
in vitro Dicer assays, the asymmetric duplexes were predict- 
ably processed into the desired primary cleavage products of 
21-22 nts in cells. We also observed trimming of the 3' ends 
and some additional Uracils added as well. Our observation that 
the most efficacious strand was the most abundant revealed 
that the relative frequencies of each "S" or "AS" strand are 
highly correlated with the silencing activity and strand selec- 
tivity. A similar observation was also found in a separate pair 
of asymmetric 25/27-mer duplexes targeting heterogeneous 
nuclear ribonucleoprotein H (hnRNP H1), further validating 
the role of the sequence composition of 3' double-nucleotide 
overhang and the proposed optimal design features. 

Taken together, our data demonstrate that even though the 
only differences between a family of DsiRNAs was the 3' two- 
nucleotide overhang, dicing polarity and strand selectivity are 
distinct depending upon the sequence and chemical nature of 
this overhang. Thus, it is possible to predictably control dicing 



polarity and strand selectivity via simply changing the 3'-end 
overhangs without altering the original duplex sequence. These 
optimal design features of 3'-overhangs provide a facile approach 
for rationally designing highly potent 27-mer DsiRNAs. 



Results 

Design of DsiRNAs against TNP03 and in vitro Dicing 
approaches 

We designed and synthesized a series of 25 base pair, 
two-base overhang containing RNAs targeting the mRNA 
produced from the TNP03 gene {Homo sapiens transpor- 
ts 3) which is one of the many HIV-1 dependency factors 25 
(Table 1).The original 21 (UU)/21 (UU) siRNA was previously 
reported to efficiently knockdown target gene expression. 25 
The shifted 21 -mer siRNAs were also designed to target sites 
shifted downstream of the original 21 -mer siRNA in incre- 
ments of 6 nt. Based upon the original 21 -mer sequence, we 
designed the 27-mers with an added six bases of the TNPO 
sequence The various arrangements of 3' two-base over- 
hangs are labeled as groups I and II, which include four sym- 
metric and six asymmetric DsiRNAs as listed in Table 1 . 

To evaluate the effects of the 3'-overhang in determining the 
position and pattern of Dicer cleavage, P 32 -end labeled duplexes 
were processed in wfrowith recombinant human Dicer, and the 
dicing products were electrophoresed in denaturing polyacryl- 
amide gels and visualized by autoradiography. Figure 1 depicts 
the two types of experimental approaches used (method A 
and B). The dicing patterns were designated by the direction 
of Dicer entry into the substrates. "L-R" indicates Dicer enters 



A Method A: 5'-end P 32 sense labeled 27-mer duplex 
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Figure 1 In vitro Dicer processing of (a) 5' P 32 -end labeled sense or antisense (b) duplexes. Two types of experimental approaches 
are displayed as method A and B, respectively. The dicing patterns are named by the direction of Dicer entering the duplex as left to right 
(L-R) and right to left (R-L). 
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Figure 2 In vitro Dicer cleavage of 5' P 32 -end labeled sense or antisense strands. Group I duplexes. The cleavage fragments that re- 
sult from dicing L-R and R-L were visualized autoradiographically following denaturing gel electrophoresis of the Dicer-cleavage products: 
(a) sense and (b) antisense. The two experimental approaches (method A and B) are defined in the legend to Figure 1 . 



from left to right, while the "R-L' model depicts Dicer processing 
from right to left. The proportions of the cleaved/uncleaved frag- 
ments that result from "L-R" and "R-L' dicing directions reflect 
the dicing efficiency for the 27-mer duplex. 

The 3-overhangs influence in vitro dicing patterns 

Initially, the products that result from in vitro digestion of group 
I duplexes were analyzed using 5'-end P 32 labeled RNAs and 
a denaturing gel electrophoresis assay. In method A, Dicer 
cleavage of the 5'-end P 32 S strand-labeled duplexes gen- 
erated two different sized S strand fragments (Figure 2a). 
For "L-R" the longer species 21-22 mer contained the P 32 , 
whereas for "R-L" short 4-5 mers from the 25-mer S or 6-7 
mers from the 27-mer AS strand, were produced, respectively. 
These results demonstrate that Dicer enters these 27-mer 
duplex RNAs by either "L-R" or "R-L." A comparable proportion 
of long and short species (the ratio of "L-R" versus "R-L" = 1 ) 



was observed in the 27 (UU)/27 (UU) duplex. However, the 27 
(tt)/27 (UU) duplex with a two-base deoxy ribonucleotide (tt) 
3' overhang on the S strand yielded greater amounts of long 
"L-R" fragments and a very small amount of "R-L' products 
(the ratio of "L-R" versus "R-L' »1). In the reverse overhang 
setting 27 (UU)/27 (tt) duplex, the ratio of "L-R" to "R-L" cleav- 
age products was <1, confirming that Dicer does not readily 
enter a duplex with dTdT 2-base overhangs. Furthermore, in 
comparison with the 27 (tt)/27 (tt) duplex, the 27 (UU)/27 (tt) 
duplex has a dominant "R-L' dicing polarity and the 27 (tt)/27 
(UU) duplex has a major "L-R" polarity, demonstrating that the 
3'-UU overhang facilitates Dicer binding and entry. A blunt end 
in the asymmetric duplexes (the 25 (GC)/27 (UU) and the 25 
(GC) /27 (tt) duplexes) seemed to show less dicing polarity. 

To further validate these observations, we also 5' P 32 
end-labeled the AS strand of the same 27-mer duplexes 
(method B) and preformed polyacrylamide denaturing gel 
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assays and autoradiography (Figure 2b). These 27-mer 
duplexes were also cleaved bidirectionally from both ter- 
mini. The 27 (UU)/27 (UU) duplex generated a comparable 
proportion of long and short species. In contrast, there was 
very little processing of the 27 (tt)/27 (tt) duplex due to its 
two 3'-dTdT overhangs. The 25 (GC)/27 (UU) and the 27 
(tt)/27 (UU) duplexes containing a 3'-UU overhang on the 
AS strand were readily processed from "L-R," primarily gen- 
erating short fragments (Figure 2b), consistent with the 
results observed when these duplexes were labeled on the 
S strand in which the long fragments were the primary prod- 
ucts (Figure 2a). Only a trace amount of the "R-L" products 
were observed following dicing of the above two duplexes 
due to the hindrance of the blunt end or 3'-dTdT overhang 
on the S strand. As observed in Figure 2a, the dicing pat- 
tern was completely reversed in the 27 (tt)/27 (UU) duplex 
and the 27 (UU)/27 (tt) duplexes. These results definitively 
demonstrate that the 3'-overhang orients Dicer entry and 
influences dicing preference, and DNA residues reduce the 
binding of Dicer. Interestingly, a heterogeneous collection of 
4-7-nt fragments was produced by dicing of the asymmetric 
27-mer duplexes (the 25 (GC)/27 UU and the 25 (GC)/27 
(tt)). These fragments may be derived from a second round 
of Dicer cleavage of these siRNAs. 

The 3'-overhangs affect RNAi potency 

To investigate the influence of 3'-overhangs on RNAi activ- 
ity, we measured the target knockdown efficacy of the group 
I duplexes using a quantitative real-time reverse transcrip- 
tion-PCR system (qRT-PCR) (Figure 3). The original 21-mer 
siRNA was more potent than the shifted-21-mer (62 versus 
40% knockdown at 50 nmol/l and 40 versus 15% knockdown 
at 10 nmol/l). Compared with the original 21-mer siRNA, the 
most potent knockdown was mediated by the 27 (tt)/27 (UU) 
duplex, providing 78 and 60% knockdown at 50 and 10 nmol/l, 
respectively. The 25 (GC) /27 (UU) and 27 (UU)/27 (UU) 
duplexes enhanced (-10%) the RNAi potency, whereas other 
27-mer duplexes were slightly less potent than the original 
21-mer siRNA. The in vitro Dicer-cleavage reactions for the 
27 (tt)/27 (UU) duplex predominantly generated "L-R" cleav- 
age products identical in size to the original 21-mer siRNA. In 
contrast, major "R-L" dicing products were produced from the 
27 (UU)/27 (tt) duplex. The "L-R' cleavage species generate 
the more effective siRNAs. 

The enhanced potency of the "L-R" cleavage might be attrib- 
uted to Dicer-generated products which result in preferential 
handoff of the AS strand to RISC. Correspondingly, the unfa- 
vorable cleavage species deriving from "R-L" dicing displayed 
reduced RNAi. These results suggest that the 27-mer duplex 
can yield a specific, desired 21-mer species which is able 
to enhance RNAi activity, even though the only differences 
among the group of 27-mer duplexes are the 3'-overhangs. 

Asymmetric DsiRNAs containing a single 3 -DNA blunt 
terminus improve RNAi potency 

The "L-R" and "R-L" dicing patterns could be in equilibrium 
and therefore compete with each other generating a mixture 
of siRNAs. By preventing one of the two dicing directionali- 
ties via an unfavorable 3'-overhang it is possible to promote 
a single Dicer entry pattern. Since the 3'-overhang plays a 
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Figure 3 Silencing of Homo sapiens transportin 3 (TNP03) 
by group I duplex Dicer-substrate small interfering RNAs 
(DsiRNAs). HEK 293 cells were transfected with 50 or 10 nmol/l 
of the experimental ant\-TNP03 duplex DsiRNAs. TNP03 mRNA 
levels were detected by quantitative real-time reverse transcription 
(qRT-PCR). The data are normalized with GAPDH mRNA levels 
and represent the average of three replicate assays. 

determinant role in orienting Dicer entry and RNAi efficiency, 
it is possible to generate the desired cleavage species via 
simply incorporating a favorable 3' overhang and an unfavor- 
able 3' blunt end without altering the original 27-mer duplex 
sequence. For group I, 27-mer duplexes, the presence of a 
3'-dTdT overhang or an RNA containing blunt end did not 
completely abolish Dicer bidirectional entry and different dic- 
ing patterns. To further restrict the dicing preference and entry 
orientation, we designed a series of asymmetric duplexes 
(group II in Table 1) with a single two-nucleotide 3'-overhang 
on the AS strand and two-nucleotide DNA residues at the 
3'-blunt end of S strand, to provide a single favorable Dicer 
entry and a restricted dicing pattern. 

In vitro Dicer processing of the group II duplexes was exam- 
ined as previously described (Figure 4b,c). As expected, the 
desired "L-R" products derived from all these asymmetric 
duplexes were the overwhelming majority indicating that the 
two-base deoxyribonucleotide 3' blunt end strongly impeded 
Dicer entry. When compared with the duplexes of group I, the 
asymmetric group II duplexes greatly simplified the in vitro 
dicing pattern, supporting the fact that terminal DNA residues 
impeded Dicer entry and can be used to direct Dicer entry 
onto a DsiRNA to obtain a single desired cleavage product. 

To further evaluate the asymmetric design, we evaluated the 
target knockdown efficacy of these RNA duplexes (Figure 4c) 
via a qRT-PCR assay. As we observed in the dicing reactions, 
the asymmetric duplexes with various 3' two-ribonucleotide 
overhangs had enhanced RNAi potency, whereas the duplex 
having a 3'-dTdT overhang had less efficient target knockdown 
efficiency. Moreover, the duplexes with a 3' two-ribonucleotide 
overhang from group II showed comparable knockdown effica- 
cies (Figure 4c) as well as dicing activities (Figure 4b). 

The 3-overhang sequences influence the guide strand 
selection of the asymmetric DsiRNAs 

To identify whether the sequence composition of the 3' 
two-base overhang is important for Dicer recognition and 
specificity, we used lllumina Deep sequencing analyses to 
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investigate the intracellular dicing products of the asymmetric 
25/27-mer duplexes transfected in HEK293 cells. At 48 hours 
post-transfection with 1 0 nmol/l of these DsiRNAs, total RNAs 
were isolated and prepared for lllumina sequencing. 

As shown in Table 2a, all the samples had similar total 
reads. From the total reads specific sequences were collected 
and aligned using the referenced sequences of the asymmet- 
ric DsiRNAs (AS or S strands). Surprisingly, the data demon- 
strated that the total reads of these RNA duplexes and the 
proportion of the AS strand to S strands were clearly distinct 



among the different duplexes (Figure 5a,b). Unequal trans- 
fection efficiencies or sample processing might cause such 
differences; however, the relative strand distributions suggest 
that the observed differences can largely be attributed to the 
3' overhang compositions. The relative abundance of AS to 
S sequences declined relative to the two-base overhangs as 
follows: GG > GC, AA > CC, UU » tt. For example, despite 
the close number of total reads for the 25 (gc)/27 (GG) duplex 
(477,396) and the 25 (gc)/27 (CC) duplex (431,423) these 
two duplexes have distinctly different strand distributions with 



Table 2 lllumina Deep sequence analyses and IC R0 values of asymmetric 27-mer TNPQ3 Dicer-substrate siRNAs (group II) 



All sequences 25(gc)/27 (GC) 


25(gc)/27 (tt) 


25(gc)/27 (UU) 


25(gc)/27 (AA) 


25(gc)/27 (GG) 


25(gc)/27 (CC) 


Cell alone 


(a) Total reads of deep sequences, fraction and abundance of sense and antisense strands from each duplex are listed. To calculate the fractions, reads for 
each strand were divided by the total number of sense or antisense reads. The strand distribution was calculated as the ratio of the abundance of antisense to 
sense. 


Total reads from deep sequence 28,068,694 


29,921,531 


30,384,765 


30,686,615 


29,867,099 


29,814,503 


24,949,154 


Total reads of antisense and sense 340,051 


201,128 


239,788 


358,466 


477,396 


431 ,423 


137 


Reads of antisense 229,347 


64,970 


135,285 


242,439 


418,187 


236,366 


63 


Reads of sense 1 1 0,704 


136,158 


104,503 


116,027 


59,209 


195,057 


74 


Percentage of antisense 67.445% 


32.303% 


56.419% 


67.632% 


87.598% 


54.788% 


45.985% 


Percentage of sense 32.555% 


67.697% 


43.581% 


32.368% 


12.402% 


45.212% 


54.015% 


as/s ratio 2.071 7 


0.4772 


1.2946 


2.0895 


7.0629 


1.2118 


0.8514 


Abundance of antisense GG > GC, CC, AA > UU » tt. 

Abundance of sense GG < GC, AA, UU < tt < CC. 

Ratio of antisense to sense GG > GC, AA > CC, UU » tt. 












Top 10 sequences 25(gc)/27 (GC) 


25(gc)/27 (tt) 


25(gc)/27 (UU) 


25(gc)/27 (AA) 


25(gc)/27 (GG) 


25(gc)/27 (CC) 




(b) Total reads of the top 10 sense and antisense strands from each duplex and the fraction of the desired "L-R" cleavage products from each strand are listed. 
L-R product reads from the sense or antisense strands were divided by the number of reads of each strand, respectively. 


Total reads of antisense and sense 237,338 


134,334 


183,025 


228,955 


318,968 


317,477 




Reads of antisense 1 47,544 


41 ,748 


94,741 


136,615 


274,202 


153,203 




"L-R" products from antisense 1 36,054 


4,862 


81,461 


115,259 


274,202 


145,567 




Percentage of "L-R" products 92.21 % 


11.65% 


85.98% 


84.37% 


100.00% 


95.02% 




Reads of Sense 89,794 


92,586 


88,284 


92,340 


44,766 


164,274 




"L-R" products from sense 88,380 


86,928 


87,604 


87,824 


42,409 


161,621 




Percentage of "L-R" products 98.43% 


93.89% 


99.23% 


95.11% 


94.73% 


98.39% 




Abundance of "L-R" products from Antisene: GG > GC, CC, AA > UU » tt. 
Abundance of "L-R" products from Sene: GG < GC, AA, UU, tt < CC. 










psiCHECK assay (RNAi potency) 25(gc)/27 (GC) 


25(gc)/27 (tt) 


25(gc)/27 (UU) 


25(gc)/27 (AA) 


25(gc)/27 (GG) 


25(gc)/27 (CC) 




(c) IC 50 values of asymmetric 27-mer duplexes were determined using the psiCHECK assay as described in Materials and Methods. When the antisense or 
sense strands were used as the "guide" strand, their target knockdown efficiency is listed as the IC 50 . The strand selectivity was calculated as the ratio of IC 50 
values of sense versus antisense targeting. 


IC 50 of antisense (pmol/l) 61 .34 ±1.18 


987.3 ± 1.31 


1 19.6 ± 1.30 


30.57 ± 1.18 


21.06 ± 1.12 


43.95 ± 1.21 




IC 50 of sense (pmol/l) 1 356 ± 1 .55 


2257 ± 1 .46 


307 ± 1 .39 


601.3 ± 1.24 


8252 ± 1 .60 


186.2 ± 1.30 




Selectivity of AS to S 22.11 


2.29 


2.57 


19.67 


391.83 


4.24 





RNAi potency of Antisense GG > AA, CC > GC > UU » tt. 
RNAi potency of Sense GG « tt < GC < AA < UU < CC. 

Selectivity of Antisense to Sense GG » GC, AA > CC > UU, tt. 

DsiRNA, Dicer-substrate small interfering RNA; TNP03, Homo sapiens transportin 3; siRNA, small interfering RNA. 



Figure 4 In vitro Dicer cleavage of 5' P 32 -end labeled sense or antisense strands. Group II asymmetric duplex Dicer-substrate small 
interfering RNAs (DsiRNAs) and RNA interference (RNAi) potency. The cleavage fragments that result from dicing L-R and R-L were 
visualized by autoradiography of the P 32 5' end-labeled strands: (a) sense and (b) antisense. Two experimental approaches (method A and 
B) and the dicing pattern are defined the legend to Figure 1. (c) Silencing of Homo sapiens transportin 3 (TNP03) triggered by group II 
duplex RNAs. HEK 293 cells were transfected with 50 or 10 nmol/l of the experimental anti- TNP03 duplex DsiRNAs. TNP03 mRNA level 
were detected by quantitative real-time reverse transcription (qRT-PCR).The data were normalized with GAPDH mRNA and represent the 
average of three replicate assays. 
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the relative ratios of AS to S being 7.06 and 1.21, respec- 
tively. Similarly, the 25 (gc)/27 (UU) duplex (239,788) and the 
25 (gc)/27 (tt) duplex (201 ,128) have a similar number of total 
reads, whereas their relative strand distributions are very dif- 
ferent. Therefore, these data demonstrate that the sequence 
composition of the 3' overhang plays an important role in 
determining the fates of these DsiRNAs with respect to the 
dicing pattern, RISC loading and strand stability. 

Manual inspection of the AS and S strands focusing only 
on the top 10 sequences, narrowed the list to 60-70% of 
the total reads (Table 2b), but this analysis still maintained 
the same propensity for relative abundance of AS/S (such 
as, GG > GC, AA > CC, UU » tt). We further identified 
the Dicer-cleaved RNA species for the top 10 sequences. 
Figure 5c and Supplementary Figure S1 display two 
major dicing patterns as previously determined: "L-R" 
cleavage generating the desired siRNAs and "R-L" cleav- 
age producing the undesired siRNAs. Consistent with the 
in vitro dicing assay, the asymmetric duplexes having a 3' 
two-ribonucleotide overhang were predictably processed 
into the desired "L-R" cleavage products of 21-22 mers in 
cells (Supplementary Figure S1). The population of "L-R" 
cleavage products is summarized in Table 2b. Approxi- 
mately 85% of the AS strands were preferentially gener- 
ated from "L-R" cleavage products. In contrast, the AS 
strand of the 25 (gc)/27 (tt) duplex only produced 1 1 .65% 
of the desired "L-R" cleavage species, which corresponds 
with the in vitro dicing results. 

Interestingly, we also observed RNA editing of both strands, 
such as trimming of a single or double-nucleotide on the 3' 
end or addition of untemplated nucleotides to the 3' or 5' ter- 
mini. As shown in Figure 5d and Supplementary Figure S1 , 
trimming of the 3' end occurred on both strands, especially 
when DNA residues were incorporated into the overhangs or 
the blunt end. Untemplated nucleotides were added to the 3' 
or 5'-termini, most likely following the trimming or dicing reac- 
tions. For example, the 3'-GC overhang on the AS strand of 
the 25 (gc)/27 (GC) duplex was trimmed by a single "C" and 
subsequently extended by a U or A (Supplementary Figure 
S1a, entry 7 and 8). For the 25 (gc)/27 (tt) duplex, after its 3' 
DNA (gc) blunt end on the S strand was trimmed, the result- 
ing RNA was subjected to uridylation of the 3' end (Supple- 
mentary Figure S1b S, entry 6, 7, and 10). Moreover, we 



Figure 5 lllumina Deep sequence analyses of asymmetric 
27-mer Homo sapiens transportin 3 (TNP03) Dicer-sub- 
strate siRNAs (group II). HEK 293 cells were transfected with 
10 nmol/l of the asymmetric group II RNA duplexes. Forty hours 
post-transfection the total RNAs were isolated and prepared for 
lllumina Deep sequencing. The data collection and alignment 
are described in Materials and Methods section, (a) The total 
reads and abundance of sense and antisense strands from each 
duplex, (b) The strand distribution was calculated as the ratio of 
the abundance of antisense to sense. The ratio of antisense (AS) 
to sense (S) is ranked by the 3' overhang GG > GC, AA > CC, 
UU » tt. (c) The dicing pattern L-R and R-L are as previously 
described. The L-R pattern generates the desired siRNA spe- 
cies for target knockdown. Total reads from the top 10 antisense 
strands and the abundance of L-R cleavage products from the 
antisense strand, (d) Two types of RNA editing: trimming of the 
3' end and post-transcriptional addition of nucleotides at the 3' 
or 5' ends. 



also found that nucleotides most frequently added to these 
duplexes were U and A. The extension of a U or A was much 
more prevalent on the 3' end than on the 5' end. Although 
the RNA editing appeared to be widespread among all the 
duplexes, the frequency is generally lower than 10% of the 
total reads. The highest frequency of editing exceeded 50% 
in the 25 (gc)/27 (GC) duplex, suggesting the 3' overhang 
composition also affects RNA extension and ultimately influ- 
ences the Dicer-cleavage site and RNAi potency. 

The 3-overhang sequence compositions influence strand 
selectivity and RNAi activity of asymmetric DsiRNAs 

It is known that selection of the guide strand depends upon 
the differences in the thermodynamic stability of the two ends 
of the 21/22-mer duplexes. 526 The 3' two-nucleotide over- 
hangs can alter the stability of the duplex ends, eventually 
affecting strand selection by the Argonaute proteins. Argo- 
naute 2 binds the guide strand in an oriented manner which 
facilitates cleavage of the passenger strand of the siRNA 
during RISC loading. 27 The potential for competition of the 
passenger strand for RISC entry necessitates proper design 
strategies to optimize the desired guide strand selection and 
to minimize off target activity by the undesired passenger 
strand. 

To evaluate the RNAi activities of the asymmetric duplexes, 
we used the psiCHECK reporter system in which the target 
sequence for the AS or S strands are inserted in the 3' UTR 
of the Renilla luciferase gene. The siRNA-mediated inhibition 
of luciferase activity for both the S and AS target orientations 
were independently tested (Figure 6a and Table 2c). The 
strand selectivity was calculated as a measure of the rela- 
tive target inhibition efficiencies for each target orientation 
(Figure 6b and Table 2c). When compared with the duplex 
harboring a 3' dTdT overhang, all the duplexes with 3' two 
base-ribonucleotide overhangs tested in this study showed 
superior inhibition of luciferase expression (IC 50 20-100 
pmol/l) when using the AS guide strand against the S target. 
However, when the S strands were used as guides for the AS 
orientation of the target, relatively lower efficiencies of target 
inhibition were observed (IC 50 200-8,000 pmol/l). The strand 
selectivity (Figure 6b) favoring the AS strand relative to the 
S strand as guides for these asymmetric duplexes demon- 
strated that the asymmetric design favors handoff to RISC of 
the AS strand as guide following dicing. Moreover, there was 
a profound bias in AS versus S strand function for the 3' GG 
overhang. The data revealed that the most efficacious strand 
was the most abundant and the relative frequencies of the "S" 
or "AS" strands are highly correlated with the silencing activ- 
ity and strand selectivity. In brief, the rankings for the RNAi 
efficiency for of the various 3' overhangs is GG > GC, CC, 
AA > UU » tt. Correspondingly, the abundance of the "L-R" 
products for the various 3' two-base overhangs is also GG > 
GC, CC, AA> UU » tt. 

Validation of the role of the two-base 3 -overhang for a 
different pair of asymmetric DsiRNAs 

To verify the role of the 3'-overhang in guide strand selec- 
tion and function, we analyzed a separate set of asymmetric 
DsiRNAs that target hnRNP H1. These two asymmetric DsiR- 
NAs differ by a single nucleotide, but have strikingly different 
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Figure 6 IC 50 values and strand selectivity of asymmetric 27-mer Homo sapiens transportin 3 (TNP03) Dicer-substrate siRNAs 
(group II). (a) IC 50 values of the asymmetric 27-mer duplexes were determined using the psiCHECK assays as described Materials and 
Methods section. When the antisense or sense was used as "guide" strand, the target knockdown efficiency is listed as the IC 50 . (b) The 
strand selectivity was calculated as the ratio of IC 50 values of the sense to antisense targets. 



functional activities, and thus represent an interesting test for 
the role of the two-base 3' overhangs (Table 3a). The RNAi 
potencies of these RNAs were also evaluated using the psi- 
CHECK system. As depicted in Table 3b, the knockdown effi- 
ciencies of both the S and AS strands of these asymmetric 
hnRNP H1 DsiRNAs were independently assessed and the 
strand selectivity was calculated. 

There is only a one-base shift along the target mRNA 
sequence between the site 325 DsiRNAs and the site 324 
DsiRNAs. The site 324 DsiRNAs showed better overall RNAi 
efficiency (combined knockdown of both S and AS targets) 
when compared with the site 325 DsiRNAs containing the 
same 3'-overhangs. The site 324 DsiRNAs showed a com- 
parable knockdown efficiency for either the S or AS strands 
as guides against their corresponding targets (Table 3b). 
Despite the weak strand selectivity for the 324 DsiRNAs, 
there was a relative tendency for biased strand selectivity for 
the 25 (gg)/27 (GG) duplexes compared to the 3'-CC, UU and 
AA overhangs. Substantial strand selectivity was observed 
for the site 325 DsiRNAs. For example, the selectivity of AS to 
S of the 25 (tg)/27 (GG) duplex is 6.73 and the 25 (tg)/27 (AA) 
duplex is 8.86, respectively. For the site 325 DsiRNAs it was 
interesting to see that the 3'-AA overhang provided the best 
RNAi activity and strongest strand selectivity. In this case, the 
3'-AA overhang makes the siRNA completely complementary 
to the hnRNP H1 target site, implying a perfectly matched 
3'-overhang probably contributes to enhanced RNAi activity. 
Additionally, we also chose two representative DsiRNAs from 
each site to evaluate the target knockdown efficacy via a qRT- 
PCR assay (Supplementary Figure S3). Consistent with our 
observation in the TNP03 case, the asymmetric duplexes 
with perfectly matched 3'-overhangs have enhanced RNAi 
potency, whereas the duplex having a 3'-dTdT overhang had 
less efficient target knockdown efficiency. For example, the 
site 325 DsiRNA 25 (tg)/27 (AA) showed a lower IC 50 value 
(44.37 pmol/l) compared with the 25 (tg)/27 (tt) duplex (IC 50 = 
284 pmol/l). 



We also used lllumina Deep sequencing analyses to 
interrogate the relative strand abundance of the hnRNP H1 
asymmetric 25/27-mer duplexes. Total RNAs were isolated 
and prepared for lllumina Deep sequencing at 24 hours post- 
transfection of these DsiRNAs in HCT116 cells. According 
to their RNAi potency and strand selectivity, we chose two 
representative DsiRNAs for each target site. For example, 
the site 324 DsiRNAs 25 (tg)/27 (GG) and 25 (tg)/27 (AA) 
that showed a big difference in RNAi activity and selectivity 
were selected for deep sequence analyses. Similarly, deep 
sequencing analyses were performed with the DsiRNA 325 
25 (gg)/27 (GG) and 25 (gg)/27 (CC) duplexes. To simplify the 
comparisons we only focused on the top 10 sequences. As 
shown in Table 3b and Figure 7a,b, the total reads of these 
RNA duplexes and the proportions of the AS to S strands 
were distinct among the experimental duplexes. In similarity 
to our previous observation with the TNP03 DsiRNAs, the 
relative abundances of the AS to S strands for both site 324 
and 325 DsiRNAs were consistent with the strand selectivity 
in the RNAi assays. For example, the relative distribution of 
AS to S (Figure 7b) ratios ranked the 25 (tg)/27 (GG) » 25 
(tg)/ 27 (AA) for site 324, and GG > CC for site 325 which is 
the same trend for strand selectivity shown in Table 3b. Thus 
these observations validated that the 3'-overhang composi- 
tion contributes to the fates of the DsiRNAs and ultimately 
RNAi activity. 

As previously described for the TNP03 DsiRNA system, 
the two major Dicer-cleavage products ("L-R" and "R-L") in 
the top 10 sequences were identified (Figure 7c and Sup- 
plementary Figure S2). Table 3c lists the population of "L-R" 
cleavage products producing the desired siRNA species. 
Consistent with the results obtained with the TNP03 sys- 
tem, >80% of the AS strands or S strands of the hnRNP H1 
DsiRNAs preferentially produced the desired primary "L-R" 
cleavage products. As shown in Supplementary Figure S2, 
RNA editing took place at the ends of both strands as previ- 
ously observed. For example, trimming of a single or double- 
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Table 3 Deep sequence analyses and IC,. n values of the hnRNP H1 25/27-mer Dicer-substrate siRNAs 



hnRNP H1 25/27-mer Dicer-substrate siRNAs (DsiRNAs) 



hnRNP H1 target sequence 



■ CTTTGAATCAGAAGATGAAGTCAAATT GG — 3' 



(a) The asymmetric 27-mer duplex RNAs against hnRNP H1 (the site 324 and site 325 DsiRNAs) in this study and the target sequence are listed. The 
sense strand is presented from 5' to 3' and is marked as black and the antisense strand is presented from 3' to 5' and is marked as gray. Two-nucleotide 3'- 
overhangs are underlined. Ribonucleotides are in upper case and deoxyribonucleotides are in lower case. These RNAs are designated by their strand length 
and overhangs: the number indicates the length of RNA strands; "(NN)" means the two-base of 3' ends 



Site 324 DsiRNAs 



Site 325 DsiRNAs 



25 (tg)/27 (CC) 
25 (tg)/27 (UU) 
25 (tg)/27 (GG) 
25 (tg)/27 (AA) 
25 (gg)/27 (CC) 
25 (gg)/27 (UU) 
25 (gg)/27 (GG) 
25 (gg)/27 (AA) 



5 U G A AU C AG A AG AU G A AG U C A A AU tg 3' 
3' CC ACUUAGUCUUCUACUUCAGUUUA AC 
5' UGAAUCAGAAGAUGAAGUCAAAU tg 3' 
3' UU ACUUAGUCUUCUACUUCAGUUUA AC 
5' UGAAUCAGAAGAUGAAGUCAAAU tg 3' 
3' GG ACUUAGUCUUCUACUUCAGUUUA AC 
5' UGAAUCAGAAGAUGAAGUCAAAU tg 3' 
3' AA ACUUAGUCUUCUACUUCAGUUUA AC 

5' G A AU C AG A AG AU G A AG U C A A AU U gg 3' 
3' CC CUUAGUCUUCUACUUCAGUUUAA CC 

5' G A AU C AG A AG AU G A AG U C A A AU U gg 3' 
3' UU CUUAGUCUUCUACUUCAGUUUAA CC 

5' G A AU C AG A AG AU G A AG U C A A AU U gg 3' 
3' GG CUUAGUCUUCUACUUCAGUUUAA CC 

5' G A AU C AG A AG AU G A AG U C A A AU U gg 3' 
3' AA CUUAGUCUUCUACUUCAGUUUAA CC 



psiCHECK assay (RNAi potency) 



Site 324 dsiRNAs (GA-natural match) 



25(tg)/27 (CC) 



25(tg)/27 (UU) 



25(tg)/27 (GG) 



25(tg)/27 (AA) 



(b) IC 50 values of asymmetric 27-mer duplexes were determined using the psiCHECK assay. When the antisense or sense strand was the "guide" strand, their 
target knockdown efficiencies are listed as IC 50 values. The strand selectivity was calculated as the ratio of IC 50 values of sense to antisense strand target 
knockdown 



IC 50 of antisense (pmol/l) 
IC 50 of sense (pmol/l) 
Selectivity of AS to S 
RNAi potency of antisense 
RNAi potency of sense 
Selectivity of antisense to sense 



: 1.08 
: 1.08 



26.56 : 

12.72: 

0.48 

GG > CC > UU, AA 
AA, CC > UU, GG 
GG > CC, UU > AA 



34.24 d 
14.14 d 
0.41 



1.1 
1.08 



16.00: 
16.03: 

1.00 



35.04 ± 1.11 
1 1 .84 ± 1 .08 
0.34 



psiCHECK assay (RNAi potency) 



Site 325 dsiRNAs (AA-natural match) 



25(gg)/27 (CC) 



25(gg)/27 (UU) 



25(gg)/27 (GG) 



25(gg)/27 (AA) 



IC 50 of Antisense (pmol/l) 
IC 50 of Sense (pmol/l) 
Selectivity of AS to S 
RNAi potency of antisense 
RNAi potency of sense 
Selectivity of antisense to sense 



61.54 ±1.09 
56.15 ±1.11 
0.91 

AA > GG > CC > UU 
AA, CC > GG, UU 
AA > GG > CC > UU 



91 .84 ± 1 .08 
296.1 ±1.11 
3.22 



31.82 ± 1.1 
214.3 ±1.11 
6.73 



5.73 ±1.07 
50.77 ± 1 .09 
8.86 



Top 10 sequences 



Site 324 25(tg)/27 (AA) 



Site 324 25(tg)/27 (GG) Site 325 25(gg)/27 (CC) Site 325 25(gg)/27 (GG) 



(c) lllumina Deep sequence analyses of hnRNP H1 DsiRNAs. HCT1 16 cells were transfected with 10 nmol/l of the asymmetric duplexes site 324 25 (tg)/27 
(AA), site 324 25 (tg)/27 (GG), site 325 25 (gg)/27 (CC) and site 325 25 (gg)/27 (GG). The data collection and alignment protocols are described in Materials 
and Methods. The total reads of the top 10 sense and antisense strands from each duplex and the fraction of desired "L-R" cleavage products from each 
strand are listed. To calculate this fraction, "L-R" product reads from sense or antisense strands were divided by the separate reads of each strand 



Total reads of antisense and sense 


158,404 


1 ,322,436 


446,025 


744,339 


Reads of antisense 


91,124 


1,281,110 


431,859 


732,830 


"L-R" products from antisense 


75,601 


1,174,340 


431,859 


675,568 


Percentage of "L-R" products 


82.96% 


91.67% 


100.00% 


92.19% 


Reads of sense 


67,280 


41,326 


14,166 


1 1 ,509 


"L-R" products from sense 


67,280 


41,326 


14,166 


1 1 ,509 


Percentage of "L-R" products 


100.00% 


100.00% 


100.00% 


100.00% 



DsiRNA, Dicer-substrate small interfering RNA; hnRNP H1, heterogeneous nuclear ribonucleoprotein H; siRNA, small interfering RNA. 
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Totalreads of top 10 sequences 
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Figure 7 lllumina Deep sequence analyses of asymmetric 27-mer heterogeneous nuclear ribonucleoprotein H (hnRNP H1) 
Dicer-substrate small interfering RNA (siRNAs).The data collection and alignment were as described above, (a) Total reads of the top 
10 sense and antisense strands from each duplex, (b) The strand distribution was calculated as the ratio of the abundance of antisense 
to sense. Ratio of antisense (AS) to sense (S) is ranked by the 3' overhang GG > AA in site 324 Dicer-substrate small interfering RNAs 
(DsiRNAs) and GG > CC in site 325 DsiRNAs. (c) The dicing pattern L-R and R-L are as previously described. The L-R model generates 
the desired siRNA species for target knockdown. The total reads of the top 10 antisense strands and the abundance of L-R cleavage 
products are presented. 



nucleotide at the 3' end occurred often on the AS strand. Also 
1 or 2 untemplated nucleotides (such as "U," "C," "CC," "UU," 
or "CU") were most frequently added to the 3' end of both 
strands. Although RNA editing appears to be a widespread 
phenomenon among all the tested duplexes, the mechanism 
is unknown. 



Discussion 

The RNAse III family member Dicer initiates RNAi by pro- 
cessing double-stranded RNAs into 21-23 nt double- 
stranded RNAs (either miRNAs or siRNAs) generating a 
two-base 3'-overhang. 7 ' 28 In association with Dicer, the 
siRNA products are loaded into RISC such that only one of 
the original strands is incorporated and used as a guide for 
the sequence-specific post-transcriptional silencing of cog- 
nate genes. The remaining strand, known as the antiguide 



or passenger strand, is degraded. Previous studies have 
demonstrated in mammals that the PAZ domain of Dicer is 
a single-stranded RNA-binding module that has preference 
for two-base single strand overhangs produced by another 
RNAse III family member Drosha, which processes primary 
miRNAs into pre-miRNAs. 6 ' 1 1,29 The PAZ/PIWI domains of the 
Ago2 protein serve as anchors to spatially orient the bound 
RNA substrates in the enzyme active site. 30,31 Therefore, 
DsiRNAs with two-nucleotide 3'-overhangs that are favorable 
for Dicer binding/cleavage and subsequent Ago anchoring are 
believed to enhance RNAi potency 9 Addition of RNA tetral- 
oops, unfavorable DNA residues or fluorescent groups at the 
ends of double-stranded RNAs have been demonstrated to 
partially or completely block cleavage by human Dicer. 6,16 

The single-stranded two-base 3' overhangs present in tra- 
ditional 21-mer siRNAs are required for optimal siRNA func- 
tion. As a regular practice in designing siRNAs, the two-base 
3'-deoxynucleotide overhangs (such as "tt") are often added to 
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19-mer siRNAs, often without regard to complementarity with 
the target sequence. However, here we find that when this fea- 
ture is applied to 25/27-mer DsiRNAs, it adversely affects dic- 
ing and subsequent RNAi activity Moreover, we also find that 
the sequence composition of ribose 3' two-base overhangs 
significantly affect dicing polarity and strand selectivity. 

We first carried out investigations of in vitro dicing products 
using symmetric 25 base pair duplexes with various overhang 
configurations (group I, Table 1). Our results showed that a 3' 
tt overhang attenuates Dicer entry onto the substrate while 3' 
UU overhangs are favorable for Dicing entry and subsequent 
cleavage. When Dicer enters the substrate from either end of 
a duplex ("L-R" or "R-L" models), resulting in heterogeneous 
cleavage products, these can consequently impact on RNAi 
potency. Even though the only minor difference among all the 
symmetric 27-mer DsiRNAs tested is the sequence composi- 
tion of the two-base 3' overhang, the 27 (tt)/27 (UU) DsiRNA 
preferentially generated more "L-R" dicing products generat- 
ing the desired siRNAs with the highest target knockdown 
efficiencies, whereas other duplexes with a 3' tt overhang 
on the AS strand were primarily processed by "R-L" Dicer 
entry generating siRNAs with poor efficacy. Because a 3'=tt 
overhang or a ribose blunt end does not completely abol- 
ish Dicer entry/cleavage in the group I design, bidirectional 
dicing products were still observed. We further restricted the 
dicing preference in order to obtain the desired "L-R" product 
through rational design of 3'-termini. These optimized asym- 
metric DsiRNAs (group II, Table 1) have a single, favorable 3' 
two-ribonucleotide overhang on the AS strand and an unfa- 
vorable two-nucleotide blunt DNA residue on the 3' end of 
the S strand 22 which simplifies the Dicing pattern, predictably 
generating a single, desired "L-R" product, thereby enhanc- 
ing RNAi potency. It is noteworthy that these design features 
can provide an optimal structure for binding by the Dicer 
PAZ domain along with an anchor site for Ago2 in RISC. The 
increased RNAi efficacy mediated by the 27-mer DsiRNAs 
can be attributed to two features (i) the major "L-R" products 
generate siRNAs with desired sequences; and (ii) the "L-R" 
products that have a 3' two-base overhang on the AS strand 
result in preferential utilization of the AS strand as a guide in 
the RNAi machinery. 

Since "L-R" and "R-L" Dicer entry patterns can coexist in 
a dicing reaction, they will compete each other. By block- 
ing "R-L" dicing via the 3'-DNA blunt end on the S strand 
and simultaneously facilitating "L-R" entry via optimized 3' 
two-base ribonuleotide overhangs on the desired AS/guide 
strand, it is possible to significantly promote the desired "L-R" 
dicing products and strand selectivity as previously demon- 
strated. 22 Furthermore, the RNA slicing-based silencing path- 
way is involved in multiple cycles of target binding, cleavage 
and product release mediated by the Argonaute 2 protein. In 
this scenario, the Argonaute 2 protein remains bound to the 
guide strand promoting guide strand selection by slicing the 
passenger strand, thereby establishing and protecting the 
guide strand from degradation. 45 

We have been interested in determining the influence of 
the sequence composition of the two-base 3' overhangs on 
Dicer processing and strand selectivity. We have examined 
the fates of asymmetric TNP03 duplexes in cells by lllumina 
Deep sequencing analyses. Additionally the RNAi potencies 



of both the "S" and "AS" strands derived from these duplexes 
were also evaluated by psiCHECK reporter gene assays. In 
similarity to our in vitro Dicer assays, the asymmetric duplexes 
with two-base 3' ribonucleotide overhangs were predictably 
and primarily processed into the desired "L-R" cleavage prod- 
ucts of 21-22 mers in cells (Table 2 and Supplementary 
Figure S1). 

The effect of the sequence composition of the two-base 3' 
overhang is an important point of discussion. In several previ- 
ous studies, a preference in Dicer binding and Dicer-cleavage 
efficiency for DsiRNAs containing purine/purine nucleotide 
3' overhangs over pyrimidine/pyrimidine 3'-overhangs was 
observed. Dicer binding and activity was ordered by the 3' 
overhangs CC > GC > GG > AA > UU, 15 GG > AA > UU > CC 

> tt, 9 and AA > GC » GG > CC > UU. 19 However, other stud- 
ies showed different tendencies. For example, a molecular 
dynamic simulation study 32 indicated that the 3'-UU overhang 
made a relatively more stable complex with the PAZ domain 
compared to 3'-GG, AA, and CC overhangs (UU > GG > AA 

> CC). In addition, an asymmetric 27 mer DsiRNA with a 
3'-UU overhang on the AS strand was shown to be the most 
potent inhibitor of gene expression compared to DsiRNAs 
having different sequence compositions of the 3' overhang 
(UU > GG, GC, CC > AA). 17 ' 22 It was noted that in this case 
the 3'-UU overhang was completely complementary to the 
target, implying a perfectly matched 3'-overhang may contrib- 
ute to RNAi activity. Our data for both the TNP03 and hnRNP 
H1 targets also demonstrated that asymmetric DsiRNAs 
with a 3'-overhang complementary to the target mRNA have 
enhanced RNAi potency. Although the optimal sequence com- 
position of the 3' termini is still controversial overall for RNAi, 
the 3' overhang composition undoubtedly contributes to Dicer 
binding and RNAi potency. Our data further demonstrate that 
the composition of the 3' two-base overhangs significantly 
influences the relative strand abundance and selectivity into 
RISC. Our observation that the most efficacious strands were 
also the most abundant revealed that the relative frequen- 
cies of "S" or "AS" strands are highly correlated with overall 
strand selectivity and hence silencing activity. For example, 
in the TNP03 DsiRNA system, there is a strong bias in the 
abundance of "L-R" products and selection of the AS strand 
with the order of preference for two-base 3' overhang being 
GG > GC, CC, AA > UU » tt. Similarly, the strand distribution 
(Ratio of AS to S) follows the same 3' overhang order of GG 

> GC, AA > CC, UU » tt. Similar observations on biases dic- 
tated by the two-base overhang were also found for another 
set of asymmetric 25/27-mer duplexes targeting the hnRNP 
H1 mRNA further validating the significance of the role of 
the sequence composition of the 3' two-base overhang. With 
both the TNP03 and hnRNP H1 mRNAs, our results support 
the observation that DsiRNAs with purine/purine (GG, AA) 
nucleotide overhangs are generally preferred over pyrimi- 
dine/pyrimidine overhangs (CC, UU). 

In addition to the role of the two-base overhang in DsiRNA 
selection and RNAi, we have observed two types of RNA 
editing on both strands of the processed DsiRNAs. These 
are trimming of a single or double-nucleotide on the 3' end 
and addition of untemplated nucleotides to the 3' or 5' termini 
subsequent to the trimming (Figure 5d and Supplementary 
Figures S1 and S2). The deep sequences reveal that the 
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trimming of the 3' end occurs on both strands following Dicer 
processing. There is more trimming when DNA residues 
were incorporated into the overhangs or blunt end. Moreover, 
the nucleotides most frequently added to these duplexes are 
U and A. The addition of a U or A was much more preva- 
lent on the 3' end than on the 5' end. Interestingly, the high- 
est frequency of RNA editing, which exceeded 50% of the 
sequences analyzed was observed for the TNP03 25 (gc)/27 
(GC) duplex, in which the 3' end of the AS was extended by a 
U or A. Considering the direct relationship between sequence 
abundance and RNAi activity of the AS strand, we conjecture 
that the untemplated nucleotide additions to the 3' end of the 
AS strand somehow facilitate RISC loading. It remains to be 
investigated what enzymes are participating in the RNA edit- 
ing. We do not know if the addition of untemplated nucle- 
otides takes place before or after Dicer cleavage. 

In conclusion, consistent with previous reports, 22 24 our data 
demonstrate that 3' RNA residues are more favorable than 
DNA residues for Dicer processing of DsiRNAs. Dicer entry 
onto DsiRNAs is oriented by the nature of the 3' ends. In addi- 
tion, the entry of Dicer also influences guide strand selection 
and ultimately RNAi potency. Since the PAZ domain is sensi- 
tive to the type of 3' overhang sequence composition, 6 Dicing 
patterns could be predictably controlled through the rational 
design of the 3' end. Furthermore, the sequence composition 
of the 3' two-base overhang appears to influence RNA editing 
of the siRNAs, as well as the Dicing pattern and recruitment 
of the siRNA guide strand into the RISC. Further research will 
be needed to understand the mechanism of RNA recognition 
and editing events in the RNAi pathway. 



Materials and methods 

Materials. Unless otherwise noted, all chemicals were pur- 
chased from Sigma-Aldrich (St Louis, MO), T4 PNK enzymes 
and buffer were obtained from New England BioLabs (Ips- 
wich, MA) and all cell culture products were purchased from 
GIBOC (Gibco BRL/Life Technologies (Carlsbad, CA), a divi- 
sion of Invitrogen, Carlsbad, CA). The cell lines HEK 293 and 
HCT1 16 are from the ATCC (Manassas, VA). Random prim- 
ers (Invitrogen); Lipofectamine 2000 (Invitrogen). 

siRNAs. All the siRNAs were synthesized and purified using 
high-performance liquid chromatography at Integrated DNA 
Technologies (Coralville, IA). All RNA duplexes against 
TNP03 (group I and group II) used in this study are listed in 
the Table 1 . All RNA duplexes against hnRNP H1 used in this 
study are listed in the Table 3a. 

In vitro Dicer assays. All the S strands and AS strands were 
end-labeled with T4 polynucleotide kinase and y- 32 P-ATR 
Unlabeled S or AS RNAs were annealed with equal molar 
amounts of 5'-end-labeled corresponding AS or S strands in 
HBS buffer in order to form siRNA duplexes. siRNA duplexes 
(1 pmol) were incubated at 37 °C for 40 minutes in the 
presence or in the absence of 1 U of human recombinant 
Dicer enzyme following the manufacturer's recommenda- 
tions (Ambion, Austin, TX). Reactions were stopped by phe- 
nol/chloroform extraction and the resulting solutions were 



electrophoresed in a 20% polyacrylamide denaturing gel. 
The gels were subsequently exposed to X-ray film. 
Cell culture. HEK 293 cells and HCT116 cells were pur- 
chased from ATCC and cultured in Dulbecco's modified 
Eagle's medium supplemented with 10% fetal bovine serum 
according to their respective data sheets. Cells were cultured 
in a humidified 5% C0 2 incubator at 37 °C. 

Determination of TNP03 gene silencing (qRT-PCR analysis). 
HEK 293 cells were split in 24-well plates to 60-70% conflu- 
ency in Dulbecco's modified Eagle's medium media 1 day 
before transfection. The cells were transfected with 10 or 50 
nmol/l of experimental anti- TNP03 duplex RNAs using Lipo- 
fectamine 2000 following the manufacturer's recommenda- 
tions (Invitrogen). Forty eight hours post-transfection total 
RNAs were isolated with TriZol reagent (Invitrogen). Expres- 
sion of the TNP03 gene was analyzed by quantitative real 
time-PCR using a 2x iQ SyberGreen Mastermix (Bio-Rad) 
and specific primer sets at a final concentration of 400 nmol/l. 
Primers were as follows: 7A/P03forward primer: 5'-CCT GGA 
AGG GAT GTG TGC-3'; TNP03 reverse primer: 5'-AAA AAG 
GCA AAG AAG TCA CAT CA-3'; GAPDH forward primer: 5' 
-CAT TGA CCT CAA CTA CAT G-3'; GAPDH reverse primer: 
5'-TCT CCA TGG TGG TGA AGA C-3'. 

TriZol reagent was used to extract total RNA according to 
the manufacturer's instruction (Invitrogen). Residual DNA 
was digested using the DNA-free kit per the manufacturer's 
instructions (Ambion). cDNA was produced using 2 ug of total 
RNA, Moloney murine leukemia virus reverse transcriptase 
and random primers in a 15 ul reaction according to the man- 
ufacturer's instructions (Invitrogen). GAPDH expression was 
used for normalization of the qPCR data. 

Illumina Deep sequence and data analysis. HEK 293 cells 
were split into 24-well plates at 60-70% confluency in Dul- 
becco's modified Eagle's medium media one day prior to 
transfection. The cells were transfected with 10 nmol/l of the 
asymmetric group II RNA duplexes using Lipofectamine 2000 
following the manufacturer's recommendations (Invitrogen). 
Forty eight hours post-transfection the total RNAs were iso- 
lated with TriZol reagent (Invitrogen) and prepared for Illu- 
mina Deep sequencing. 

To identify the most frequent S and antisense products 
from each of the DsiRNA molecules, the sequences gen- 
erated from Illumina Pipeline v1.6 were aligned with the S 
and AS strands of each siRNA molecule using Novoalign 
v2.05 (http://www.novocraft.com). All subsequent analyses 
were carried out using the R statistical environment and 
Bioconductor packages "Biostrings" and "ShortRead." 33 Only 
sequences that could be aligned to the siRNA sequences 
without mismatches were retained. The relative starting and 
ending positions of the siRNA sequences were determined 
based on their aligned positions and lengths, and the fre- 
quency of each product was counted. Only the 10 most fre- 
quent products are reported. 

To examine whether there are nucleotide additions or dele- 
tions at either end of the Dicer processed products, the raw 
sequences were matched to the siRNA AS sequence with a 
seed size of 16 after removing the 3'-adapter using the Bio- 
conductor package "ShortRead." For example, for a siRNA 
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sequence length of 23, the lllumina sequences were aligned 
totally to eight seeds which are the subsequences from bases 
1-16, 2-17, and so on, of the original siRNA sequence. The 
matched sequences were then reduced to a set of unique 
sequences along with their number of occurrences. This set 
of sequences was then aligned with the siRNA reference 
sequence using the ClustalX2 multiple alignment tool 34 not 
allowing gaps. The multiple aligned sequences were visual- 
ized and exported using JalView. 35 The extra bases at either 
end of the product were highlighted manually. 

Dual luciferase assay (detection of IC 50 value). The 45 base 
pair oligomer of TNP03 cDNA was inserted into the Spel 
and Xho\ restriction endonuclease sites downstream of 
the humanized Renilla luciferase gene in the psiCHECK-2 
vector (Promega, Fitchburg, Wl) to generate plasmids psi- 
CHECK-7A/P03-AS (passenger strand reporter) and psi- 
CHECK- TNP03- S (guide strand reporter). HCT116 cells 
were cotransfected in a 96-well format (25,000 cells/well) 
with 10 ng of the respective psiCHECK-7A/P03-AS or psi- 
CHECK- TNP03- S vector, 100 fmol/l-50 nmol/l DsiRNAs 
and 0.1 ul Lipofectamine2000 (Invitrogen) per well. Cells 
were lysed in 1x Passive Lysis Buffer (Promega) 24 hours 
after transfection and analyzed using the Dual-Luciferase 
Reporter System (Promega) on a Veritas microplate lumi- 
nometer (Turner Biosystems, Sunnyvale, CA).The average 
values were calculated from three replicates to set Renilla/ 
Firefly luciferase expression to 100%. An IC 50 curve was 
generated using Prism 5.01 software (GraphPad, La Jolla, 
CA). Sigmoidal dose responses were calculated according 
to Y= Bottom + (Top - Bottom)/(1 + 10) V((LogEC 50 - X)); 
where X is the logarithm of concentration and Y is the 
response. 

The sequences of Oligomers. 
TNP03 AS 

TNP03 AS_S: 5'-CCg CTCGAG ggagcaaagc cgacattgca 

gctcgtgtac caggc agtgc aggcg ACTAGT CC-3'; 

TNP03 AS_AS: 5'-GG ACTAGT cgcct gcactgcctg gtacac- 

gagc tgCaatgtcg gctttgctcc CTCGAG CGG-3' 

TNP03 sense(S) 

TNP03 S_S: 5'-CCG CTCGAG cgcct gcactgcctg gtacacgagc 
tgCaatgtcg gctttgctcc ACTAGT CC-3'; 
TNP03 S_AS: 5'-GG ACTAGT ggagcaaagc cgacattgca 
gctcgtgtac caggcagtgc aggcg CTCGAG CGG-3' 
The fragment of 343 base pair in hnRNP H1 cDNA includes 
the region of bases 90-432 in the reference sequence 
NM_005520. The reporter plasmids ps\-hnRNP H-S (sense 
reporter) and ps\-hnRNP H-AS (AS reporter) were derived by 
cloning the hnRNPH sequences in the 3'-UTR of the human- 
ized Renilla luciferase gene in the psiCHECK-2 (Promega). 22 

Supplementary material 

Figure S1. lllumina Deep sequence analysis of asymmetric 

TNP03 27-mer Dicer-substrate siRNAs (group II). 

Figure S2. lllumina Deep sequence analyses of asymmetric 

27-mer hnRNP H1 Dicer-substrate siRNAs. 

Figure S3. Silencing of hnRNP H1 by the site 324 and 325 

duplex DsiRNAs. 
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